J 



Europaisches Patentamt 
European Patent Office 
Office europeen des brevets 




© Publication number: 



0 514 663 A2 



© Application number: 92106314.5 
© Date of filing: 11.04.92 



EUROPEAN PATENT APPLICATION 

© int. CI. 5 : H04N 7/133 



® Priority: 24.05.91 US 705234 


© Applicant: International Business Machines 


© Date of publication of application: 


Corporation 


Old Orchard Road 


25.11.92 Bulletin 92/48 


Armonk, N.Y. 10504(US) 


© Designated Contracting States: 


© Inventor: Gonzales, Cesar A. 


DE FR GB 


RFD 4, Box 270, Foley Road 




Katonah, New York 10536(US) 




Inventor: Viscito, Eric 




55 Mill Plain Road, Unit 18-4 




Dan bury, Connecticut 06811 (US) 




© Representative: Schafer, Wolfgang, Oipl.-lng. 




European Patent Attorney, IBM Deutschland 




GmbH, Schdnaicher Strasse 220 




W-7030 Bdblingen(DE) 



0 An apparatus and method for motion video encoding employing an adaptive quantizer. 
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© An apparatus and method for encoding of a video picture is disclosed. The video picture has a plurality of 
pictures. The pictures each have a plurality of macroblocks. The macroblocks each have a plurality of sub- 
blocks. The apparatus comprising a first module configured to generate a transform coefficient Cj| for each of the 
sub-blocks of the macroblock. The apparatus further comprises a second module configured to variably quantize 
the transform coefficient by a scaling factor Q p based on the complexity of the picture and any rate control 
requirements. 
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The present invention relates generally to video imaging systems and methods. More particularly, the 
present invention relates to a system and method for digital still picture and motion video compression. 

Due to the current interest in digital multimedia interactive programs, there has been extensive activity 
in the international standards bodies dealing with still picture and motion video compression. In particular, 

5 the Motion Picture Experts Group (MPEG), a group working under the sponsorship of the International 
Standards Organization, is rapidly converging towards a standard for the compression of digital motion 
video. One of the interesting features of this standard is that only the "decoding" algorithm syntax will be 
specified in detail. It will be thus possible to have different encoders, all of which produce bit streams 
compatible with the standard's syntax, and yet result in different levels of video quality. The MPEG video 

w standard is fully described in ISO-IEC JTC1/SCZWG1 1 , MPEG 90/176 Rev. 2. December 18, 1990. This 
reference is hereby incorporated in its entirety into this disclosure. The MPEG standards will be briefly 
discussed herein. 

Generally, the MPEG video standard defines a layered architecture for compressing a video sequence. 

First, a sequence of video pictures is subdivided into disjoint Groups of Pictures (GOP). Each GOP is 
75 compressed independently of other GOPs to facilitate random access to any picture and also to limit the 

propagation of transmission errors. 

Every picture in a GOP is subdivided into Macro-Blocks (MB). For a color picture, a MB is a collection 

of 16 x 16 luminance pixels and two 8 x 8 blocks of chrominance pixels. In MPEG, the two chrominance 

components are sampled at half the horizontal and vertical resolution of the luminance. As such, a MB 
20 completely describes a 16 x 16 color segment of a picture. In a MB, the 16 x 16 luminance pixels are 

further subdivided into four luminance blocks of 8 x 8 pixels. 

MBs can be coded into two modes, namely: intra and predictive. In intramode, a MB is coded 

independently of pixel data in previous or future pictures. In predictive mode a MB is coded with reference 

to pixel data in either a previous (forward prediction), a future picture (backward prediction), or both 
25 (interpolative prediction). A prediction is formed by applying motion compensation techniques to the 

referenced pictures, and an MB error data is generated by subtracting the prediction from the original pixel 

data. 

The MPEG standard requires that the first picture in a GOP be coded as an intrapicture. An intra picture 
is defined as having all of its MBs coded in the intramode. The remaining pictures of a GOP are then coded 

30 as either unidirectional predictive pictures (Le., its MBs are coded in a mixture of intramode and forward 
prediction) or bidirectional predictive pictureslTe., its MBs are coded in any of the MB coding modes). 

The still or motion picture data in the form of MB's (represented by either the actual MB pixel data 
(intramode) or only the error data (predictive) to be compressed is then inputted to a first compression step. 
This first compression step is a transformation applied by a 2-dimensional 8x8 Discrete Cosine Transform 

35 (DOT) to each of the MB blocks. 

After applying the DOT to the six blocks in a MB, MPEG suggests that the resulting transform 
coefficients undergoes a second compression step. This second compression step is a scaling and 
truncation step (referred to in the art as "quantization"). Each of the DCT coefficients are uniformly 
quantized with a matrix of quantization steps. MPEG specifies one of two reference matrices from which the 

40 quantization steps may be derived. The choice of which matrix depends on the MB mode. This second step 
is an additional compression step which is necessary to achieve adequate compression of the picture data. 
Although the reference matrices can be defined by the encoder at the beginning of a video sequence, they 
remain fixed afterwards. MPEG allows dynamic changes to the matrix of quantization steps, however, by 
allowing a scaling factor for the reference matrices; this scaling factor can be changed for every MB. 

45 MPEG, however, does not disclose an apparatus or method for determining and changing the scaling factor. 
Keeping this scaling factor constant may result in the unnecessary loss of picture quality during the 
compression mode. 

The present invention is directed to providing an apparatus and method for determining and changing 
the scaling factor left undefined by MPEG. An apparatus and method that performs this function is defined 

50 as adaptive quantization (AQ). As such, the present invention is an apparatus and method for encoding still 
and motion pictures employing an adaptive quantization feature to the transform coefficients for improved 
quality of still pictures and motion video compression. 

The adaptation is performed on a MB to MB basis and varies based on the complexity of the image 
and the available rate control requirements. 

55 In one embodiment, the encoder of the present invention comprises a transform coefficient module and 
an adaptive quantization module. The transform coefficient module of the preferred embodiment employs a 
conventional Discrete Cosine Transform (DCT) function to generate a transform coefficient Cij for each sub- 
block of a macroblock. 
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Each transform coefficient Cjj is then inputted to the adaptive quantization module where it is variably 
quantized based on (1) the complexity of the image and (2) any rate control requirements that may be 
present in the encoding apparatus. 

In one embodiment, the adaptive quantization module is configured to perform a first step quantization 
5 followed by a second step quantization. In the first step, each transform coefficient Cjj is scaled by a fixed 
weighting factor matrix to yield a partially quantized coefficient Cwjj. Two matrices can be defined in MPEG. 

The second quantization step is that of scaling Cw^ by a second scaling factor q p which remains 
constant for all Cw }j in a macroblock, but can vary from macroblock to macroblock to yield a fully quantized 
coefficient Cq^. In one embodiment, the scaling factor q p is determined based on the complexity of the 
w picture (macroblock) as measured by Cwjj and the rate control requirements. 

With regard to the complexity of the picture, the adaptive quantization module will select the scaling 
factor q p based on a minimax algorithm applied to the Cwjj coefficients of the four luminance blocks 
contained in a MB. 

In this regard, when the picture is complex the user will not be visually suspectible to image 
is abnormalities. As such, the quantization factor, q p , may be high, thus reducing the number of bits used. In 
contrast, when the picture is not complex, the user is susceptible to image abnormalities. As such, the 
quantization factor. q p , may be low. thus reducing compression but at the cost of increasing the number of 
bits used. The minimax algorithm senses the complexity of the picture and is used, in part, to determine q p . 
With regard to rate control requirements, encoders typically employ an equalizing buffer to ensure that 
20 a mechanism exists for equalizing the variable bit rate at which data is generated by compression and the 
constant bit rate that is typical of many storage media. The equalizing buffer has an upper and lower rate 
control requirement. The lower rate control requirement is such that a minimum amount of information must 
always be stored in the buffer. The upper rate control requirement is such that only a maximum amount of 
information can be stored in the buffer at a given time. As such, it is important that the q p ensure that the 
25 occupancy of the buffer stay within these upper and lower bounds. 

Accordingly, the present invention strikes a balance between maximization of the quality of the image 
while staying within the upper and lower boundaries of the rate control requirements. 

Although the present invention is described in the context of the MPEG standard, it should be clear that 
the present invention is applicable to any video compression scheme that is transform based. The nature of 
30 the transform (Discrete Cosine. Hadamard, Lapped Overlapped, etc.) and the size and/or structure of the 
transform blocks and MB can be changed without affecting the fundamental ideas of this invention. For 
example, a macroblock could be re-defined to contain fewer or more of the blocks defined in MPEG. 

The foregoing and other objects, features and advantages of the invention will be apparent from the 
following more particular description of (a) preferred embodiment(s) of the invention, as illustrated in the 
35 accompanying drawing(s). 

The following detailed description of the present invention will be more fully understood with reference 
to the accompanying drawings in which: 

FIGURE 1 is a high level block diagram showing of the motion picture encoder of the present invention; 
FIGURE 2 is a block diagram showing the transformation coefficient module and adaptive quantization 
40 module of the present invention; 

FIGURE 3 is a more detailed block diagram showing the architecture of the adaptive quantization 
module; and 

FIGURE 4 is a more detailed block diagram showing the architecture of the q p selection module. 
The present invention is an apparatus and method for encoding of still or motion pictures. The encoder 
45 of the present invention employs an adaptive quantization (AO) feature such that the quantization of the 
picture data is automatically varied to produce the high quality image while maintaining acceptable bit rate 
control requirements. 

Referring to FIGURE 1, where a high level block diagram of the motion picture encoder 100 of the 

present invention is illustrated. As shown, the motion picture encoder 100 generally comprises an input 
so device 102 for inputting a sequence of motion pictures. The input device 102 may be. for example, the 

output from a digital video cassette recorder or a digitized output from any analog image capture device, 

such as a camera, VCR. and the like. 

The digital motion picture is inputted by input device 102 along a bus 104 to an encoder 106. As will be 

described more fully herein, the encoder 106 of the present invention -is configured to perform a 
55 transformation and quantization step. As will also be more fully described, the quantization step of the 

present invention is based on the complexity of the image and the rate control requirements of a buffer 118 

(to be described). 

Further shown is a bit allocator 108. Bit allocator 108 is provided to allocate a specific numbers of bits 
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for each picture. Bit allocator 108 may take a number of configurations. In one embodiment, the bit allocator 

108 may be configured to assign exactly the same number of bits to each picture. Alternatively, the bit 

allocator 108 could be configured such that the number of bits allocated is dependent on the picture being 

processed. For example, the MPEG group in document ISO/JEC JTCVSC2 / W61t, MPEG 90/41, July, 1990. 
s disclosed a method for variable bit rate allocation., This reference is hereby incorporated by reference in its 

entirety into this specification. As will be shown more fully herein, the encoder 112 of the present invention 

can be configured to operate with a variety of bit allocation strategies. 

The compressed data picture is then outputted along a bus 112 to a variable length coder (VLC) 114 

where the encoded information is represented by at least one bit. VLC encoding is well known in the art and 
70 is a further compression step. 

After the compressed data is coded by the VLC 114, it is outputted along a bus 116 to a buffer 118. 

Buffer 118 is provided so as to equalize the variable bit rate of the compressed data being outputted by 

VLC 114 on bus 116 with the constant bit rate of a storage device 122. A storage device, in this case, may 

be for example any media operating at a fixed data rate. For example, one common bit rate is that of a CD- 
15 ROM which operates at about a bit rate of 1.5 Mbit/s. As such, it is important to (1) ensure that there is 

always enough data in the buffer 118 and (2) that buffer 118 is not-overflowed. 

Further shown is a bus 124. Bus 124 allows the encoder 106 to sense the "fullness" (either too tow or 

too high) of buffer 1 18. As will be described more fully herein, the encoder 106 is configured to quantize the 

transform coefficient by a scaling ratio which is determined based on the "fullness" of the buffer 118 and 
20 the complexity of the video picture. 

Referring now to FIGURE 2, where a high level block diagram of the encoder 106 is illustrated. As 

shown, the encoder 106 first comprises a transform coefficient device 202. 

The transform coefficient device 202 is provided to transform each MB into a corresponding transform 

coefficient. In the preferred embodiment, the transform coefficient device 202 is configured to perform a 
25 Discrete Cosine Transform (DCT) function. The DCT algorithm is well known in the art and will not be 

described herein. As a result of the transform coefficient device 202. each block in an MB is transformed 

into a set of transform coefficients C^. 

The Cjj for each block is then outputted along a bus 204 to an adaptive quantization device 206. The 

adaptive quantization device 206 is configured to quantize each transform coefficient C^ by an appropriate 
30 amount such as to achieve sufficient compression while maintaining image quality. The quantized transform 

coefficient is represented by Cq^. The quantizing of each transform coefficient C }i is thus not fixed and is 

based on the (1) fullness of the buffer 1 18 and (2) the complexity of the image. 

The mathematical foundation for the present invention will now be described. Generally, the process of 

uniform quantization of an 8 x 8 matrix of transform coefficients C j} can be described by the following 
35 equation; 



40 



50 



Cq u = INTEGER [ 7^ + I i*3 s 8 (l1 



where lNTEGER[x] extracts the integer portion of x; C^ are the transform coefficients, and Cq^ are the 
resulting quantized steps. Qij are the corresponding quantization steps. The parameter k takes the value of 
1 for quantization with rounding to the nearest integer and 0 for truncation. Note that throughout this 
45 application, positive transform coefficients are assumed. For negative coefficients, all formulas remain valid 
if the magnitude is extracted first and the sign is restored after the quantization process is completed. 

For use within the MPEG standard, Equation [1] can be rewritten in a different way by making = 
qpWij/8. In this case, 



Cqj< - INTEGER [ 8Ci * + jf] i,j s 1 8 {2] 



55 where Wjj is a set of quantization weights and q p is the quantization scaling factor. In the MPEG standard, 
two integer matrices of u» fj can be defined to code a sequence. Additionally, in the MPEG standards q p is 
allowed to vary between 1 and 31 on a MB to MB basis. It is this variability that permits adaptive 
quantization. The apparatus and method of the present invention determines the appropriate value for q p . As 
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will be more fully described herein, the present invention automatically chooses q p such as to optimize the 
visual appearance of a video sequence while maintaining a constant average output data rate. 

The present invention attempts to distribute the available bits equally among all MBs in a picture. In this 
manner, MBs with high energy would be assigned a coarser quantizer than those with low-energy content. 
5 This assignment is in accordance with the results of many experiments with the human visual system that 
suggest that humans are more tolerant to errors in areas of a picture with great activity or energy. As such, 
q p should be selected according to some measure of energy content for a block (small transform coefficient 
Cjj corresponds to little energy.) An alternative embodiment would also take into account the energy of 
neighboring MBs. 

w The assignment of q p , however, cannot be arbitrary. In applications where the output bandwidth is fixed, 
q p must also be used to control the average compressed data rate. Typically, quantized coefficients 
together with other data are coded by variable length codes. One consequence of this coding is that the 
data associated with a single video picture is not constant. In order to equalize the variable data rate at 
which compressed data is generated, and the output data rate at which it is transmitted, the data must be 
;5 buffered in the buffer device 118. Typically, the output data rate is constant. However, the apparatus and 
method of the present invention is equally applicable to embodiments where the output data rate is variable. 
Once the size of buffer 118 is chosen, the rate at which the compressed data is generated must be 
regulated so that the buffer 118 does not overflow or underflow (becomes empty). This rate control is also 
accomplished by dynamically modifying q p . The encoder 100 is capable of adapting q p to improve the 
20 overall quality of a picture while simultaneously satisfying the buffer 118 size constraints. 

In the preferred embodiment, the output data rate has a target rate of around 1.5 Mbifcs with a video 
resolution of 352 x 240 x 30 pixels/s. As will become apparent to one of ordinary skill in the art. the present 
invention can be extended in trivial ways to operate at higher resolutions and bit rates. 

In one embodiment, the adaptive quantization device 206 operates to split the quantization process of 
25 Equation 2 into two steps; a quantization by ^ followed by a quantization by q p . In between these two 
steps, the value of q p is selected based on the results of the first step as well as on the fullness of the 
output data rate equalizing buffer 118. 

Returning now to FIGURE 3, a block diagram illustrates one embodiment of the adaptive quantization 
device 206. 

30 As shown by Figure 3, the transform coefficients Qj areinputted to a first multiplier 302 where they are 
multiplied with 2 < 7wi j which quantity is inputted by a block 304 along a bus 306. This is the first quantization 
step and yields a partially quantized coefficient Cwfj. The scaling factor 2* can be selected such that enough 
precision is maintained even if all operations are carried out with integer arithmetic. Since the values of wjj 
do not change for a video sequence, the multiplicative factors 2"/w,j will also remain constant and they only 

35 need be computed once at the beginning of a sequence. The pre-computed values could be stored with 
arbitrary precision in a table of 64 integers (one for each of the 8 x 8 transform coefficients). In MPEG, two 
<*>ij matrices are allowed and therefore two tables are needed. 

Thereafter, the output of multiplier 302 (Cw^) is inputted to a second multiplier 310 where Cwjj is 
multiplied by denoted by a block 312. This is the second quantization step. However, unlike the first 

40 quantization step, q p isvariable and is determined by a q p selection device 314. The scaling factor 2 q can be 
selected such that enough precision is maintained even if all operations are carried out with integer 
arithmetic. As will be shown more clearly below, quantization by q p can be carried out through multiplication 
by one of 3t pre-stored integer values corresponding to all possible 2 q/ q p values in MPEG. In this case, the 
q exponent determines the precision of the results of this step. There are extensions of this idea that 

45 improve further the precision of the arithmetic operations without increasing the number of bits required for 
the intermediate quantization results. For example, one could split the multiplication by 2"/a>jj into a 
multiplication by 2 , ** rT Va>ij followed by normalization (division or binary shift) by 2 m . Of course, if floating 
point arithmetic is available, no power of two scaling is necessary. 

As such, the quantization step defined mathematically by Equation [2] and illustrated in Figure 3 is split 

so into two separate steps. The first step is mathematically expressed as: 

Cw Xj = INTEGER [-2?- C X A i,j = l, 8 [3] 

55 

where Equation 3 is carried out in the multiplier 302. The second step can be expressed as: 
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Cq Sj = 2 3 ' w ~* INTEGER [ — Cw^l**^ k) i , J - 1 8 [4] 

5 

where Equation 4 is carried out by the multiplier 310, adder 320, and multiplier 328. In Equation 4, the value 
of q p is chosen based, at least in part, on the values of Cwjj and thus the energy level (or complexity) of the 
macroblock. 

10 It should be understood that if a floating point processor is used, u> and q can be set equal to zero. 

The resultant quantized transform coefficient Cq Vl is then outputted on the bus 112 to the VLC device 

114. 

As discussed before, q p is selected on the basis of the energy content of transform blocks, and 
alternatively their neighborhood, as well as on the basis of buffer 118 fullness and bit allocation consider- 

75 ations. The higher the value of q p , the more compressed the picture becomes. As such, the fewer bits need 
to be allocated but at the cost of image quality. For high energy pictures, it has been found that the user 
cannot easily notice image abnormalities. In contrast, for low energy pictures the user can easily notice 
image abnormalities. As such, q p should be as low as possible for low energy pictures thus reducing 
compression. In the case of MPEG, because an MB naturally defines a neighborhood of four 8x8 

20 luminance blocks, it constitutes a natural unit for measuring energy. However, because of rate control 
requirements, q p has an upper and lower limit. As such, it is desirable to maximize the image quality only 
when needed while staying within the rate control limits. 

The q p selection device 314 is configured to operate generally as follows: 

1. Select a first quantizer factor, q p ° in the range between 1 and 31 on the basis of bit allocation and 
25 buffer 118 fullness control considerations. The actual method of selection is not important and could be 

one of many possibilities. For example the above-referenced document, MPEG 90/41, describes one 
such method. 

2. Select a second quantizer factor, q p ow , on the basis of energy considerations for a MB. The preferred 
method for this selection is based on the determination of the minimum of the maximum energy content 

30 transform coefficient for a given macroblock. This can be expressed by the following steps: 

a. For the 4 luminance blocks in an MB, obtain the maxima of the partially quantized Cw^ coefficients 
of each luminance block. This can be represented as 

35 CwJL ' *ff iCWijl b-1 4 [5a] 



where the index b represents each of the four luminance blocks in a MB. 
40 b. Determine Cw m j nimax as 

<*W«* = ^ tcv£j [5b] 

In other words, determine the minimum of the four maximum Cw max b partially quantized transform 
coefficients (Cwjj) corresponding to each luminance block in a MB. 

c. Select q p low such that Cq min j m3 x is a predefined value, i^ using Equation 4 for example, 

50 

CQaam^x - 2 3 ~"~ g INTEGER [ — Cw miniBAX ^^k\ lSc) 

55 

For the case of Cq min imax = 2 m , the derivation of q p ,ow can be simplified significantly if the rounding 
factor (k = 0) is ignored. In this case, 
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9p 2o "-2>— • Cto^^ [6] 



5 3. The final q p selection is based on q p ,ow , appropriately bounded to satisfy rate control and other 
constraints. For example, because q p ° is chosen on the basis of rate control considerations, a large value 
of q p ° suggests that the rate-equalizing buffer 118 is close to full and that we should choose q p s q p °. 
On the other hand, if the buffer 118 is far from full, any q p a q p low s q p ° is acceptable. A heuristic that 
works fairly well in MPEG where q p varies between 1 and 31 is 

10 

q p = min {g£ , max { q* ov , integer [1*30 ] tt ] ) ) [7] 

75 

where a > 1. In particular, for a rate around 1.5 Mbit/s, a = 2.5 works well. 

As will be obvious to one skilled in the art, other heuristics are equally possible. An additional 
confirmation of the q p selection device 314 could insure that no quantized luminance coefficient exceeds a 
maximum value. MPEG requires, for example. Cq^ < 2 s . This requirement can be satisfied by insuring that 
20 the final q p is at least as large as the value of q p min defined below 



25 



- 2 J -«- e Cw aijaMX [9] 

30 



Obviously, these constraints can also be extended to the chrominance blocks. 

Referring now to Figure 4, wherein the q p selection device 314 is described in more detail. 
35 As shown, the partially quantized transform coefficients Cw {j are inputted along a bus 316 to a first latch 

402. First latch 402 is provided to ensure that only one transform coefficient is passed to a max select 

module 406 (to be described). The first latch 402 only holds the data for one clock cycle. 

The transform coefficients are than outputted along a bus 404 to the max select module 406. The max 

select module 406 is provided to determine the maximum transform coefficient for each of the four sub- 
40 blocks of each macroblock. Operation of max select module 406 is equivalent to the mathematical 

expression of Equation 5a. 

The second latch 410 is initialized to zero. After 64 cycles, the second latch 410 will contain the 

maximum value among the first 64 transform coefficients Cwjj of a single 8x8 pixel block which also 

corresponds to one sub-block within a macroblock. 
45 At this point, a 64 cycle counter 403 operates to clear the second latch 410 and likewise reset the 

operation of the first latch 402 and first max select module 406. The first latch 402 and the first max select 

module 406 are then ready for the next sequence of transform coefficients that represent the next sub-block 

of the macroblock. 

In short, the function of first max select module 406, first latch 402, and second latch 420 is to simply 
so pick the maximum transform coefficient Cw^ of each sub-block within the macroblock. 

The 64 cycle counter 403 after reading 64 cycles knows that the second latch 410 contains the 
maximum transform coefficient Cw^ that corresponds to the current sub-block. As such, this maximum value 
is outputted along a bus 412 to a first min select module 414. Min select module 414 is configured to 
determine the minimum transform coefficient of the four maximum transform coefficient calculated for one 
55 macroblock. This is defiend as Cw mlnimax . Operation of min select module 414 corresponds to the 
mathematical relationship of Equation 5b. 

In operation, third latch 420 is initialized to a very large value. Therefore, every time the first counter 
403 reaches the count of 64 cycles, the min select module 414 is prompted to take one comparison. The 



7 



EP 0 514 663 A2 

comparison is with the current value stored in the third latch 420 and the current maximum transform 
coefficient Cw }j being inputted to the min select module 414 along bus 412. As such, after four cycles of the 
first 64 cycle counter 403, the third latch 420 will be loaded with the minimum value of the maximum 
transform coefficient Cw^ of the 4 blocks of a macroblock defined as Cw minimax . The 4 cycle counter 405 is 

5 essentially the one that counts the 4 blocks. When the 4 luminance blocks of a macroblock have gone 
through this process, then the fourth latch 426 gets loaded with Cw mimma)< along a bus 424. 

At that point, Cw min)max is outputted to a multiplier 430 where it is normalized to the correct power 
denoted by block 432. The output of multiplier 430 is q p low The operation of multiplier 430 corresponds to 
the mathematical expression given by equation 6. 

/o The value for q p ,ow is then outputted to a second max select module 436. The max select module 436 is 
generally provided to ensure that the ultimate value chosen for q p is not below the lower bounds of any rate 
control requirements. 

The output of the max select module 436 is a value of q p greater or equal than a value below which you 
cannot go because of rate control requirements. If a lower value is elected, the rate equalizing buffer may 

15 begin to overflow, thus causing information to be lost. If q p ,ow goes below the lower bound, then the max 
select module 436 will select instead the lower bound value for q p calculated by the lower bound select 
module 440 (to be described). 

The value outputted by the max select module 436 is outputted to a min select module 444. The min 
select module 444 is provided to insure that an ultimate value for q p is chosen that is less or equal to a 

20 value above which you cannot go because of rate control requirements. In other words, the max select 
module 436 will not allow q p iow to go above some maximum value q p °. In this case, the concern for rate 
control is that of underflowing the equalizing buffer. It could be that the buffer is almost empty and it does 
not have data to transmit. As such, the min select module 444 ensures that the final q p selectedwill not be 
above a certain value which is required for rate control. This value of q p ° is calculated from a measure of 

25 buffer fullness. As discussed, one method for calculation of q p ° is described in MPEG 90/41. 

Finally, the set of blocks incorporated in 460 can be used to implement equations 8 and 9. We recall 
that the purpose of these equations is to define a value q p min which is a lower bound for q p that 
guarantees that the final quantized coefficients Cq.j will not exceed a pre-defined maximum. It will be 
recalled that the devices 414, 420, 426, 430, and 432 implement the mathematical equations 5b and 6. The 

30 devices incorporated in module 460 are the same except, for replacing a min select for a max select. This 
is also the difference between Equations 5b and 6, and their corresponding Equations 8 and 9. 

It should be understood that the quality of intramode MBs is important for the overall video quality. In 
particular, the quality of the MBs in an intrapicture of a GOP is crucial for determining the quality of the rest 
of the pictures in that GOP. At low bit rates, the most objectionable distortion of many transform based 

35 schemes is the appearance of blockiness due to coarse quantization of the transform coefficients that lead 
to a mismatch of pixel intensities around the edges of the transformed blocks. This blockiness is most 
visible in areas of the picture that are relatively smooth, he. where there is little luminance activity. Once 
introduced, intrapicture blockiness tends to remain for the reit of a GOP; it is thus important to mitigate it in 
those areas where they are most visually annoying. Areas of low luminance activity are characterized by the 

40 low energy content in the AC coefficients of their DCT transform. The DC coefficient, however, only defines 
the average value of a pixel block and not its activity. For this reason, the present invention applies the 
algorithm of FIGURE 4 only to the 63 AC coefficients and not to the one DC coefficient. In addition, for 
coding around 1.5 Mbit/s, the preferred value of Cq min imax is 2. 

Adaptive quantization in Predictive MB's is slightly different. In this case, the DC coefficients should be 

45 included in the process of FIGURE 4. For coding at around 1.5 Mbit/s, it is useful to only apply adaptive 
quantization to the MBs in the forward predictive pictures. For this case, Cqminimax = 1 is preferred. 

While the invention has been particularly shown and described with reference to preferred embodiments 
thereof, it will be understood by those skilled in the art that the foregoing and other changes in form and 
details may be made therein without departing from the spirit and scope of the invention. 

50 

Claims 

1. An apparatus for encoding of a video picture, the video picture having a plurality of pictures, the 
pictures each having a plurality of macroblocks. the macroblocks each having a plurality of sub-blocks. 
55 the apparatus comprising: 

(a) a first module configured to generate transform coefficients Cjj for each of the sub-blocks of the 
macroblock; and 

(b) a second module configured to variably quantize said transform coefficients, said second module 
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being further configured to quantize said transform coefficients C ti in a first quantization step 
followed by a second quantization step, said first quantization step being quantization by a matrix of 
weights in a predefined matrix to generate a set of partially quantized coefficients Cw^ for each of 
the sub-blocks of the macro block, said second quantization step being quantization by a variable 
5 scaling factor q p based on the calculation of said Cw^ for each sub-block in the macroblock. 

2. The apparatus of claim 1, wherein said q p is based on the calculation of Cw minjmax where Cw mjnimax is 
the minimum of the maximum of the Cwjj for each sub-block of the macroblock. 

w 3. The apparatus of claims 1 or 2. wherein q p is also based on the rate control requirements of an output 
equalizing buffer. 

4. The apparatus of claim 3, wherein said rate control requirements consist of an upper boundary and a 
lower boundary for said scaling factor q p . 

5. The apparatus of claim 4, wherein q p is selected from among said Cw minimax , said upper and lower 
bounds of said rate control requirements. 

6. A method for encoding of a video picture, the video picture having a plurality of pictures, the pictures 
20 each having a plurality of macroblocks. the macroblocks each having a plurality of sub-blocks, the 

method comprising the step of: 

(a) generating a transform coefficient C*j for each of the sub-blocks of the macroblock; and 

(b) generating a variable quantization factor Q p , said step of generating a variable quantization factor 
Q p comprises the step of performing a first quantization step followed by a second quantization step, 

25 said first quantization step being quantization by a fixed scaling weight ^ to thereby generate a 

partially transformed transform coefficient Cw jj( said second quantization step being quantization by 
a variable scaling factor q p based on the calculation of said Cw^ for each sub-block in the 
macroblock; and 

(c) scaling said transform coefficient Cy by said quantization factor Q p to generate a quantized 
30 transform coefficient Cq*j. 

7. The method of claim 6. wherein step (b) further comprises the step of determining q p based on the 
calculation of a Cw mjnjmax , where said Cw minimax is the minimum of the maximum of said Cw^ for each 
sub-block of the macroblock. 

35 

8. The method of claims 6 or 7, wherein step (b) further comprises the step of determining q p based on 
the rate control requirements of an output equalizing buffer. 

9. The method of claim 8, wherein step (b) further comprises the step of determining the rate control 
40 requirements based on an upper boundary and a lower boundary for said variable scaling factor q p . 

10. The method of claim 9, wherein step (b) further comprises the step of selecting q p is from among said 
Cw minlrna3t> said upper and lower bounds of said rate control requirements. 

45 
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